Estimating the indexability of multimedia descriptors for similarity searching
نویسندگان
چکیده
A study on properties of data sets representing public domain audio and visual content and their relation to their indexability is presented. Data analysis considers the pairwise distance distributions and various techniques to estimate the true intrinsic dimensionality of the studied data. One own alternative to dimensionality estimation is also presented. These results are contrasted with the indexability results gathered using indexing techniques M-Tree, LSH and hierarchical k-means tree.
منابع مشابه
Text Based Approaches for Content Based Image Retrieval in a P2P Network
The tremendous growth of digital multimedia content on the web requires scalable, efficient, and effective information retrieval mechanisms. Handling such large collections of data in a centralized way requires costly high bandwidth connectivity and powerful servers. This establishes the need of distributed architectures, such as peer-to-peer systems, that allow sharing of data management and s...
متن کاملIssues Concerning Dimensionality and Similarity Search
Effectiveness and efficiency are two important concerns in using multimedia descriptors to search and access database items. Both are affected by the dimensionality of the descriptors. While higher dimensionality generally increases effectiveness, it drastically reduces efficiency of storage and searching. With regard to effectiveness, relevance feedback is known to be a useful tool to squeeze ...
متن کاملProximity-Based Order-Respecting Intersection for Searching in Image Databases
As the volume of non-textual data, such images and other multimedia data, available on Internet is increasing. The issue of identifying data items based on query containment rather than query equality is becoming more and more important. In this paper, we propose a solution to this problem. We assume local descriptors are extracted from data items, so the aforementioned problem reduces to findi...
متن کاملDetermination of critical properties of Alkanes derivatives using multiple linear regression
This study presents some mathematical methods for estimating the critical properties of 40 different types of alkanes and their derivatives including critical temperature, critical pressure and critical volume. This algorithm used QSPR modeling based on graph theory, several structural indices, and geometric descriptors of chemical compounds. Multiple linear regression was used to estimate the ...
متن کاملIssues in Using Knowledge to Perform Similarity Searching in Multimedia Databases without Redundant Data Objects
This paper presents a possible way that artificial intelligence can be used to perform searching in a multimedia database management system without redundant data. Specifically, it finds the nearest neighbors of some query object without computing the distance between it and every other item in the database. Knowledge about the data in the system is required to perform searching. This knowledge...
متن کامل